Smart speakers

ABSTRACT

This invention relates to a method (and corresponding system) for providing location-aware media information and more specifically a method for providing location-aware audio content by an audio-presenting device capable of presenting audio content. On the audio source ( 401 ) comprising processing means ( 301 ) one or more sensors ( 404 ), may be positioned in order to locate the position(s) of one or more audio-presenting devices attached, close, or distant to said audio source. The sensors are used by receiving signal(s) to determine the location of the available audio-presenting devices. The audio-presenting device&#39;s ( 402, 403 ) location relative to the user&#39;s working position—in front of the display device ( 406 )—may be estimated by the audio source ( 401 ) and thereby provides information items to the audio-presenting devices ( 402, 403 ) by the method according to the present invention so as to provide desired sound signals.

FIELD OF THE INVENTION

This invention relates to a method for providing location-aware mediainformation and more specifically a method for providing location-awareaudio content by an audio-presenting device capable of presenting audiocontent.

The present invention also relates to a system for performing the methodand a computer program for performing the method.

BACKGROUND OF THE INVENTION

DE 196 46 055 discloses an audio playback system comprising areproducing device, a speaker system, and a signal-processing unit forimproving the spatial experience to a listener by applyingpsycho-acoustic signal processing. Physical placement of a speakersystem is assisted by processing the presented audio to e.g. compensatethe speed of audio in air. The audio output from a source is processedwith effects to trick the listening ears in believing that the presentedaudio is coming from a direction where no speaker is actually placed.

This type of audio processing to e.g. virtually expand the size of theroom and/or virtually displace sounds is commonly used in conjunctionwith consumer-related media productions where the size of the roomand/or the number of surrounding speakers are limited. The processed andimaged/mirrored audio does not necessarily reflect the actual placementof musical instruments as they were recorded, but mostly introduces afeel of another location i.e. a concert hall, a church, an outdoorscene, etc. To obtain information of the actual placement of thephysically available speakers in a system it may, however, be necessaryto provide a calibration procedure prior to processing the sound sourceto compensate room characteristics, etc. This calibration may comprisean impulse response for each of the available speakers, where theimpulse response may comprise speaker-independent characteristics suchas group delay and frequency response, etc.

In a special audio-optimized environment, e.g. a soundproofed chamber,such a method may be sufficient to obtain an acceptable impulse responseto desirably render an audio signal.

However, in a real environment, such as a living room or a kitchen,etc., it is a very difficult challenge to obtain authentic impulseresponses to accomplish trustworthiness by the listener due to roomreverberations, background noise, placement of probe microphones etc.during the calibration procedure.

To process the audio optimally with respect to audio placement, it maynot be necessary to inquire impulse responses for the speaker system. Itmay be necessary for the processing unit to know the exact placement ofspeakers and the listener for estimation of acceptable processingschemes.

The human ear tolerates a slight deviation in speaker placement, but itis not possible to convince a listener that a sound is coming from theleft speaker, when it is actually being played from e.g. the rightspeaker. Therefore, to satisfy and convince a listener of a speakerplacement, the speaker actually has to be placed relatively near theintended location of the sound.

For this sake, it may be convenient to physically place a speaker on achosen spot and let this speaker play material that may be appropriatefor this location.

For example, if a speaker playing music is placed close to a listener,the listener may observe a given level of sound. If the speaker isplaced at a longer distance from the listener, the playing speaker mustcarry out more power to let the listener obtain the same sound level aswhen the speaker is placed closer to him.

An example of the use of a speaker system according to the presentinvention could be watching a concert on television where an organ isplaying on the left and a guitar is playing on the right. Positioning anaudio-presenting device on the left would present the sound of theorgan, positioning the audio-presenting device on the right wouldotherwise present the sound of the guitar.

In a stereo system where a left and a right audio signal is representedbut only one loudspeaker placed to the left of a listener is available,it may be desirable to only reproduce the left signal to avoid spatialconfusion of the listener. Likewise, if the loudspeaker is placed infront of the listener, the reproduced audio may comprise an appropriatemix of the left and the right audio channel.

Likewise, this may also be the situation in a surround soundenvironment, where a number of loudspeakers (typically 4 to 6) areplaced around the listener to generate a 3D-like sound image. Thespeaker location is essential to e.g. instrument placement and accuratemirroring of acoustic spaces for high precision sound positioning.Unless e.g. the rear speakers (the speakers positioned behind thelistener) in a surround sound setup are placed exactly symmetricallyrelative to the listener, undesirable effects may be apparent such ase.g. non-uniform sound delay, sound coloration, wave interference, etc.In addition, if the front speakers in a surround sound environment areplaced further apart from the user than the rear speakers, a front/rearbalance control of e.g. an amplifier has to be adjusted to prevent therear speakers from dominating the sound image. However, the soundscoming from the rear speakers still arrive first at the listener by wayof the physically shorter distance. This disadvantage is typicallydisregarded in home theatre arrangements.

A speaker system according to the present invention provides users witha system that enables them to position speakers in a space relative tothe current auditory content without troublesome speaker/amplifieradjustments.

For processing the audio according to the speaker placement, it isnecessary for the sound system to identify the loudspeaker location. Itmay be difficult and sometimes even impossible for a user to enter theexact location of a loudspeaker. Therefore it may be advantageous if thesound system is able to automatically determine the speaker placementprior to signal processing.

With that, the user can add audio-presenting devices without having toenter any software-based set-up programs or adjusting any systemsetting. All the user has to do is position the speaker somewhere withinthe useful area and the processing unit will determine which auditorysignals will be presented through the audio-presenting device.

OBJECT AND SUMMARY OF THE INVENTION

It is an object of the invention to solve the above-mentioned problem ofspeaker placement without user interference.

This is achieved by a method (and corresponding system) of providinglocation-aware audio content by an audio-presenting device capable ofpresenting audio content, the method comprising the steps of obtaining,in a processing unit, at least one location parameter representing thelocation of the audio-presenting device; processing, in said processingunit, current audio content on the basis of the obtained at least onelocation parameter in order to obtain a location-aware audio contentbeing relative to the current audio content dependent on the at leastone location parameter; and presenting the obtained location-aware audiocontent by the audio-presenting device.

It is a further object of the invention to provide a method and systemwherein the processing of audio content comprises processing stepsconsidering audio capabilities of the audio-presenting device.

This invention provides a user with a system that enables him toposition a speaker relative to a current auditory content without havingto consider any programming of speaker placement. The system willdetermine which auditory signals will be presented through the speaker.

An audio-presenting device may be a speaker capable of reproducingaudible signals, as well as signals inaudible to the human ear. Ingeneral, the idea of the present invention covers the automatic transferof location-aware content from a source, i.e. the content of an audiosource, to an audio-presenting device relative to its location.

Said audio source may be a personal computer, a television, a videocamera, a game unit, a mobile phone, etc. capable of detecting saidlocation(s) of an audio-presenting device, and capable of subsequentlytransferring a corresponding content to said audio-presenting device.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows an audio-presenting device connected to an audio source ina basic setup,

FIG. 2 shows a method of presenting content with an audio-presentingdevice,

FIG. 3 illustrates a schematic block diagram of a processing unit in anaudio source,

FIG. 4 shows a setup with two audio-presenting devices with locationreference to a display device,

FIG. 5 shows another embodiment of the present invention,

FIG. 6 illustrates a schematic block diagram of musical instrumentsplaced in a stereophonic reproduction setup,

FIG. 7 illustrates another schematic block diagram of musicalinstruments placed in a quadraphonic reproduction setup.

Throughout the drawings, the same reference numerals indicate similar orcorresponding features, functions, etc.

DETAILED DESCRIPTION OF EMBODIMENTS

FIG. 1 shows an audio-presenting device, here a speaker unit, denoted byreference numeral (101) with one or more transmitters (102) placed infront of a listener denoted by reference numeral (105). On the audiosource (103) one or more sensors, indicated by reference numeral (104),may be positioned in order to locate the position(s) of one or moreaudio-presenting devices attached, close, or distant to said audiosource. The number of sensors are used, by receiving signal(s) sent fromone or more transmitters positioned on or integrated in theaudio-presenting device, to determine the location of theaudio-presenting device(s). In other words, by means of said sensor(s),the audio source may locate said audio-presenting device(s).Subsequently, the audio source may determine information (dependent onsaid location) representing audio content (106) which has to betransferred and presented on said audio-presenting devices.

FIG. 2 shows a method of presenting content with an audio-presentingdevice.

In step 201, the method in accordance with a preferred embodiment of theinvention is started. Variables, flags, buffers, etc., keeping track oflocations, content, information item(s), identifying signal(s), etc.corresponding to the status of audio-presenting devices located relativeto an audio source and corresponding to the status of said audio sourceare set to default values.

In step 202, the audio-presenting device may be connected or attached toan audio source. This will typically be a user action in that the usermay desire that the audio-presenting device may be in operation.

It may be the case that this step is repeated for more audio-presentingdevices. The steps to be followed may then correspondingly apply.

In step 203, at least one transmitter—located on the audio-presentingdevice—preferably transmits a corresponding signal identifying thedevice. As discussed in FIG. 1, one or more transmitters may bepositioned on or integrated in the audio-presenting device. This orthese transmitter(s) may then be used to inform the audio source thatsaid audio-presenting device is connected to it. Said signal may be usedto identify the audio-presenting device, its type and characteristics,etc.

In step 204, at least one sensor may receive at least one identifyingsignal. Said sensor(s) is/are preferably located on the audio source. Asdiscussed in the foregoing step and in FIG. 1, the identifying signal(s)is/are transmitted from one or more transmitters located on theaudio-presenting device.

In step 205, the audio source may obtain a first location of theaudio-presenting device.

In step 206, the audio source may determine, on the basis of obtainedlocation information what content part or parts from the audio contenthas to be processed and played back subsequently on the audio-presentingdevice. It may be the case that this step is repeated for moreaudio-presenting devices. Based on one or more identifying signals, theaudio source may determine specific X, Y, Z coordinates of theaudio-presenting device. Said coordinates may be defined relative to afixed point on the audio source or e.g. a location of the room, etc. andmeasured by it by means of received identifying signals(s).

Said audio content may be electric or acoustic signals, analog, digital,compressed or non-compressed audio, etc. or any combination thereof.

In step 207, the audio parts from step 206 are processed in order toobtain a location-aware audio content relative to the current audiocontent dependent on the at least one location parameter.

In step 208, the audio source may transfer context-aware audio contentto the audio-presenting device. Said first information item may betransferred and then received by means of a network—as a generalsolution known from the prior art—or it may be received by means of anoptimized communication dedicated to the audio-presenting device.

In step 209, the audio-presenting device may receive andpresent/reproduce said context-aware audio content.

The context-aware audio content (presented on said audio-presentingdevices) may further be dependent on what is currently presented on theaudio source, as it may be convenient to present a part of what iscurrently presented on the audio source with e.g. different processingattributes, if any.

Throughout the application—when the wording “presentation”, “present” orthe like is used—it is understood to mean that content may be reproducedon a corresponding audio-presenting device.

The wording “content”, is understood to be audio information typicallyplayed back on a personal computer, a television, a video camera, a gameunit or a mobile phone, etc. Said information or content may be electricsignals, compressed or non-compressed digital signals, etc. or anycombination thereof.

FIG. 3 illustrates a schematic block diagram of an embodiment of anaudio source (301) comprising one or more microprocessors (302) and/orDigital Signal Processors (306), a storage unit (303), and input/outputmeans (304) all connected via a data bus (305). The processor(s) and/orDigital Signal Processor(s) (306) are the interaction mechanism amongthe storage unit (303) and the input/output means (304). Theinput/output means (304) is responsible for communication with theaccessible sensor(s), wherein transport of received location parameters,etc. may occur during operation. Location parameters can be uploadedfrom remote audio-presenting devices via the input/output means (304).This communication between an audio-presenting device and the sensor(s)may take place e.g. by using IrDa, Bluetooth, IEEE 802.11, wireless LAN,etc. but will also be useful in a wired application solution. Thestorage unit (304) stores relevant information like a dedicated computerprogram or uploaded location parameters for determination of availableresources, processing algorithms, etc.

Digital Signal Processors may be dedicated programmed for differentprocessing tasks such as decoding, encoding, effect layering, etc.Either a single multi-issue DSP may comprise several processing means ora multiple of DSPs can be nested to perform processing tasks where eachDSP is dedicated to fewer processing means than the single multi-issuedDSP.

The overall processing may also be comprised in a single general-purposeprocessor comprising software for a multitude of tasks, whereinprocesses are defined among different processing functions. The use ofgeneral-purpose microprocessors, instead of DSPs, is a viable option insome system designs. Although dedicated DSPs are well suited to handlesignal-processing tasks in a system, most designs also require amicroprocessor for other processing tasks such as memory managing, userinteraction, relative location estimation, etc. Integrating systemfunctionality into one processor may be the best way to realize severalcommon design objectives such as lowering the system part count,reducing power consumption, minimizing size, and lowering cost, etc.Reducing the processor count to one also means fewer instruction setsand tool suites to be mastered.

Furthermore, the invention relates to a computer-readable mediumcontaining a program for making a processor carry out a method ofproviding location-aware media content by an audio-presenting device(101) capable of presenting audio content (106), the method comprisingthe steps of obtaining, in a processing unit (103), at least onelocation parameter representing the location of the audio-presentingdevice (101); processing, in said processing unit (103), current audiocontent on the basis of the obtained at least one location parameter inorder to obtain a location-aware audio content being relative to thecurrent audio content dependent on the at least one location parameter;and presenting the obtained location-aware audio content by theaudio-presenting device (101).

In this context, a computer-readable medium may be a program storagemedium i.e. both physical computer ROM and RAM, removable andnon-removable storage drives, magnetic tape, optical disc, digitalversatile disc (DVD), compact disc (CD or CD-ROM), mini-disc, hard disk,floppy disk, smart card, PCMCIA card, information acquired from datanetworks e.g. a local area network (LAN), a wide area network (WAN), orany combination thereof, e.g. the Internet, an intranet, an extranet,etc.

FIG. 4 shows a setup with two audio-presenting devices (402, 403) withlocation reference to a display device denoted by reference numeral(406) all with one or more transmitters (not shown) placed in front of alistener denoted by reference numeral (405). On the audio source (401)comprising processing means (301) one or more sensors, indicated byreference numeral (404), may be positioned in order to locate theposition(s) of one or more audio-presenting devices attached, close, ordistant to said audio source. The sensors are used, by receivingsignal(s) sent from one or more transmitters positioned on or integratedin the audio-presenting devices, to determine the location of theavailable audio presenting devices. The audio-presenting device's (402,403) location relative to the user's working position—in front of thedisplay device (406)—maybe estimated by the audio source (401) andthereby provides information items to the audio-presenting devices (402,403) by the method described hereinbefore to provide desired soundsignals accordingly.

The audio source may be supported by surround-sound technologies capableof sending audio information to individual channels, and therebydifferent audio-presenting devices, to generate a 3d-like sound-image.By gathering location placement parameters of the individualaudio-presenting devices at different locations, appropriate audioprocessing may be executed in order to spatially enhance a listeningexperience.

Correspondingly, the audio-presenting device(s) is/are connectableand/or attachable to the audio sources or may be placed relative to theaudio source and there connected to it, and furthermore, theaudio-presenting device is capable of receiving and presenting contentfrom the audio source.

Another example of an embodiment of the present invention can be seen inFIG. 5 wherein a media content source (501) transmits all availableaudio content without the above-mentioned processing prior totransmission. In this example, content processing is carried out in theaudio-presenting devices (502, 503, 504, 505, 506), a number of devicescomprising processing means (not shown), prior to user presentation.Each audio-presenting device comprises means (not shown) for receivingmedia content transmitted from the content source (501) and means forobtaining location parameters relative to a user (505). The user (505)may wear, or be attached to, location transmitting means (not shown) toinform any audio-presenting devices of its position.

Furthermore, each audio-presenting device may comprise processing meansas described in the foregoing to process the media content accordinglyto the location of the audio-presenting devices relative to the user'sposition.

For example, if the audio-presenting device (503) in front of the userdetermines that it is located directly in front of the user, it may bedetermined by the device that this should reproduce the center channelin a 5.1 surround signal. If, for example, the media content isavailable in stereo only, it may be determined by the frontaudio-presenting device to reproduce an appropriate mix of the left andthe right audio channel, etc.

Furthermore, the processing of media content may comprise capabilitiesof the available audio-presenting devices. For example, if a loudspeakeris only capable of reproducing signals in the frequency range of 10-200Hz, but the media content comprises signals outside that range and i.e.therefore should be reproduced, this audio-presenting device limitationmay be considered in the processing steps. This lack of reproductionpossibility may be compensated in the processing steps by e.g.processing media content for other audio-presenting devices accordingly,if any.

FIG. 6 illustrates a schematic block diagram of musical instrumentsplaced in a stereophonic reproduction setup. The stereo recordingcomprises a guitar on the left channel (602) and a drum set on the rightchannel (603). When placing an audio-presenting device according to theinvention at the far right side (603) of the listener (105), the audiodevice may be configured to only play the sounds coming from the drumset. Placing the audio-presenting device to the far left of the listener(105) may result in presenting only the guitar. If now, for example, theaudio-presenting device placed to the far left is located in the samerelative direction in relation to the listener but this time closer tothe listener, the audio-presenting device may need to turn down theoutput power, in order to obtain an identical volume level of soundreceived by the listener.

FIG. 7 illustrates another schematic block diagram of musicalinstruments placed in a quadraphonic recording setup. Four separatetracks are recorded comprising guitar (602), drum set (603), piano(701), and a violin (702). To reproduce the same ambience duringreproduction as in the recording stage, four audio-presenting devicesplaced around a listener (105) may be required. Similarly to theabove-mentioned stereo recording, every audio-presenting devicereproduces sonic material corresponding to its location. If placedsymmetrically in a quadrant like the instruments in the Figure, everysingle audio device approximately plays back only a single instrument.If, for example, the audio-presenting device in the 3rd quadrant isturned off, no or only a little bit of piano (701) may be found in theacoustic image.

Placing a speaker in the middle of the quadrant may e.g. reproduce allof the four instruments.

While the description above refers to particular embodiments of thepresent invention, it will be understood by those skilled in the artthat many details provided above have been described by way of exampleonly, and modification may be made without departing from the scopethereof.

The accompanying claims are intended to cover such modifications aswould fall within the true scope and spirit of the present invention.The disclosed embodiments are therefore to be considered in all respectsas illustrative and not restrictive, the scope of the invention beingindicated by the appended claims, rather than the foregoing description,and al changes coming within the meaning and range of equivalency of thefollowing claims are therefore intended to be embraced therein

1. A method of providing location-aware media content by anaudio-presenting device (101) capable of presenting audio content (106),the method comprising the steps of: obtaining, in a processing unit(103), at least one location parameter representing the location of theaudio-presenting device (101) using a wireless communication between theprocessing unit and the audio-presenting device; processing, in saidprocessing unit (103), current audio content on the basis of theobtained at least one location parameter in order to obtain alocation-aware audio content being relative to the current audio contentdependent on the at least one location parameter; and presenting theobtained location-aware audio content by the audio-presenting device(101).
 2. A method as claimed in claim 1, wherein the processing unit(103) comprises the steps of: receiving the at least one locationparameter from the audio-presenting device (101); and transmitting theobtained location-aware audio content to the audio-presenting device(101) prior to presenting the same.
 3. A method according to claim 1,wherein the processing unit is comprised by an audio-presenting device(502, 503, 504, 505, 506), and comprises the steps of: receiving thecurrent audio content; and presenting the obtained location-aware audiocontent by the audio-presenting device (502, 503, 504, 505, 506).
 4. Amethod according to claim 1, wherein said at least one locationparameter is determined as a parameter relative to a user's workspace.5. A method according to claim 1, wherein the steps of processing audiocontent comprise processing by using audio reproduction capabilities ofthe audio-presenting device.
 6. A system for providing location-awaremedia content by an audio-presenting device (101) capable of presentingaudio content (106), the system comprising means for: obtaining, in aprocessing unit (103), at least one location parameter representing thelocation of the audio-presenting device (101) using a wirelesscommunication between the processing unit and the audio-presentingdevice; processing, in said processing unit (103), current audio contenton the basis of the obtained at least one location parameter in order toobtain a location-aware audio content being relative to the currentaudio content dependent on the at least one location parameter; andpresenting the obtained location-aware audio content by theaudio-presenting device (101).
 7. A system according to claim 6, whereinthe processing unit (103) comprises means for: receiving the at leastone location parameter from the audio-presenting device (101); andtransmitting the obtained location-aware audio content to theaudio-presenting device (101) prior to presenting the same.
 8. A systemaccording to claim 6, wherein the processing unit is comprised by anaudio-presenting device (502, 503, 504, 505, 506), and comprises meansfor: receiving the current audio content; and presenting the obtainedlocation-aware audio content by the audio-presenting device (502, 503,504, 505, 506).
 9. A system according to claim 6, wherein said at leastone location parameter is determined as a parameter relative to a user'sworkspace.
 10. A system according to claim 6, wherein the steps ofprocessing audio content comprise processing by using audio reproductioncapabilities of the audio-presenting device.
 11. A computer-readablemedium containing a program for making a processor carry out the methodof claim 1.